Automatic Algorithm Development Using New Reinforcement Programming Techniques
نویسندگان
چکیده
ion in reinforcement learning. Artificial Intelligence, 112(1-2): 181–211. URBANOWICZ, R. J., and J. H. MOORE. 2009. Learning classifier systems: A complete introduction, review, and roadmap. Journal of Artificial Evolution and Applications, 2009. doi: 10.1155/2009/736398. WATKINS, C. J. 1989. Learning from delayed rewards. Ph.D. thesis, Cambridge University, Cambridge, UK. WHITE, S., T. R. MARTINEZ, and G. RUDOLPH. 2010. Generating three binary addition algorithms using reinforcement programming. In Proceedings of the 48th Annual Southeast Regional Conference (ACMSE ’10). ACM Press: New York. DOI: 10.1145/1900008.1900072 http://doi.acm.org/10.1145/1900008.1900072 WHITE, S. K. 2006. Reinforcement Programming: A New Technique in Automatic Algorithm Development. Master’s thesis, Brigham Young University, Provo, UT. WHITESON, S., and P. STONE. 2006. Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7: 877–917. XU, X., D. HU, and X. LU. 2007. Kernel-based least squares policy iteration for reinforcement learning. IEEE Transactions on Neural Networks, 18(4): 973–992.
منابع مشابه
Shuffled Frog-Leaping Programming for Solving Regression Problems
There are various automatic programming models inspired by evolutionary computation techniques. Due to the importance of devising an automatic mechanism to explore the complicated search space of mathematical problems where numerical methods fails, evolutionary computations are widely studied and applied to solve real world problems. One of the famous algorithm in optimization problem is shuffl...
متن کاملAutomatic ei
This paper describes a general approach for automatically programming a behavior-based robot. New behaviors are learned by trial and error using a performance feedback function as reinforcement. Two algorithms for behavior learning are described that combine techniques for propagating reinforcement values temporally across actions and spatially across states. A behavior-based robot called OBELI...
متن کاملAutomatic ei Sriclhar ahadevan an Jonathan Connell
This paper describes a general approach for automatically programming a behavior-based robot. New behaviors are learned by trial and error using a performance feedback function as reinforcement. Two algorithms for behavior learning are described that combine techniques for propagating reinforcement values temporally across actions and spatially across states. A behavior-based robot called OBELI...
متن کاملNetwork Planning Using Iterative Improvement Methods and Heuristic Techniques
The problem of minimum-cost expansion of power transmission network is formulated as a genetic algorithm with the cost of new lines and security constraints and Kirchhoff’s Law at each bus bar included. A genetic algorithm (GA) is a search or optimization algorithm based on the mechanics of natural selection and genetics. An applied example is presented. The results from a set of tests carried ...
متن کاملGenetic Encoding of Agent Behavioral Strategy
The general framework tackled in this paper is the automatic generation of intelligent collective behaviors using genetic programming and reinforcement learning. We define a behavior-based system relying on automatic design process using artificial evolution to synthesize high level behaviors for autonomous agents. Behavioral strategies are described by tree-based structures, and manipulated by...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Intelligence
دوره 28 شماره
صفحات -
تاریخ انتشار 2012